Overview

Dataset statistics

Number of variables21
Number of observations21
Missing cells16
Missing cells (%)3.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory12.6 KiB
Average record size in memory615.4 B

Variable types

NUM17
CAT4

Warnings

data has constant value "21" Constant
stato has constant value "21" Constant
terapia_intensiva is highly correlated with ricoverati_con_sintomi and 9 other fieldsHigh correlation
ricoverati_con_sintomi is highly correlated with terapia_intensiva and 5 other fieldsHigh correlation
totale_ospedalizzati is highly correlated with ricoverati_con_sintomi and 5 other fieldsHigh correlation
isolamento_domiciliare is highly correlated with ricoverati_con_sintomi and 9 other fieldsHigh correlation
totale_positivi is highly correlated with ricoverati_con_sintomi and 9 other fieldsHigh correlation
variazione_totale_positivi is highly correlated with isolamento_domiciliare and 2 other fieldsHigh correlation
nuovi_positivi is highly correlated with terapia_intensiva and 4 other fieldsHigh correlation
dimessi_guariti is highly correlated with ricoverati_con_sintomi and 5 other fieldsHigh correlation
deceduti is highly correlated with terapia_intensiva and 3 other fieldsHigh correlation
casi_da_sospetto_diagnostico is highly correlated with terapia_intensiva and 5 other fieldsHigh correlation
totale_casi is highly correlated with ricoverati_con_sintomi and 10 other fieldsHigh correlation
tamponi is highly correlated with isolamento_domiciliare and 3 other fieldsHigh correlation
casi_testati is highly correlated with terapia_intensiva and 4 other fieldsHigh correlation
note is highly correlated with denominazione_regioneHigh correlation
denominazione_regione is highly correlated with noteHigh correlation
note has 16 (76.2%) missing values Missing
note is uniformly distributed Uniform
codice_regione has unique values Unique
denominazione_regione has unique values Unique
lat has unique values Unique
long has unique values Unique
ricoverati_con_sintomi has unique values Unique
totale_ospedalizzati has unique values Unique
isolamento_domiciliare has unique values Unique
totale_positivi has unique values Unique
variazione_totale_positivi has unique values Unique
nuovi_positivi has unique values Unique
dimessi_guariti has unique values Unique
deceduti has unique values Unique
casi_da_sospetto_diagnostico has unique values Unique
casi_da_screening has unique values Unique
totale_casi has unique values Unique
tamponi has unique values Unique
casi_testati has unique values Unique
casi_da_screening has 1 (4.8%) zeros Zeros

Reproduction

Analysis started2020-11-08 19:12:56.582784
Analysis finished2020-11-08 19:13:36.395962
Duration39.81 seconds
Software versionpandas-profiling v2.9.0
Download configurationconfig.yaml

Variables

data
Categorical

CONSTANT
REJECTED

Distinct1
Distinct (%)4.8%
Missing0
Missing (%)0.0%
Memory size296.0 B
2020-11-08T17:00:00
21 
ValueCountFrequency (%) 
2020-11-08T17:00:0021100.0%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length19
Median length19
Mean length19
Min length19

Overview of Unicode Properties

Unique unicode characters8
Unique unicode categories4 ?
Unique unicode scripts2 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
014736.8%
 
16315.8%
 
24210.5%
 
-4210.5%
 
:4210.5%
 
8215.3%
 
T215.3%
 
7215.3%
 

Most occurring categories

ValueCountFrequency (%) 
Decimal Number29473.7%
 
Dash Punctuation4210.5%
 
Other Punctuation4210.5%
 
Uppercase Letter215.3%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
014750.0%
 
16321.4%
 
24214.3%
 
8217.1%
 
7217.1%
 

Most frequent Dash Punctuation characters

ValueCountFrequency (%) 
-42100.0%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
T21100.0%
 

Most frequent Other Punctuation characters

ValueCountFrequency (%) 
:42100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Common37894.7%
 
Latin215.3%
 

Most frequent Common characters

ValueCountFrequency (%) 
014738.9%
 
16316.7%
 
24211.1%
 
-4211.1%
 
:4211.1%
 
8215.6%
 
7215.6%
 

Most frequent Latin characters

ValueCountFrequency (%) 
T21100.0%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII399100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
014736.8%
 
16315.8%
 
24210.5%
 
-4210.5%
 
:4210.5%
 
8215.3%
 
T215.3%
 
7215.3%
 

stato
Categorical

CONSTANT
REJECTED

Distinct1
Distinct (%)4.8%
Missing0
Missing (%)0.0%
Memory size296.0 B
ITA
21 
ValueCountFrequency (%) 
ITA21100.0%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length3
Median length3
Mean length3
Min length3

Overview of Unicode Properties

Unique unicode characters3
Unique unicode categories1 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
I2133.3%
 
T2133.3%
 
A2133.3%
 

Most occurring categories

ValueCountFrequency (%) 
Uppercase Letter63100.0%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
I2133.3%
 
T2133.3%
 
A2133.3%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin63100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
I2133.3%
 
T2133.3%
 
A2133.3%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII63100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
I2133.3%
 
T2133.3%
 
A2133.3%
 

codice_regione
Real number (ℝ≥0)

UNIQUE

Distinct21
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11.85714286
Minimum1
Maximum22
Zeros0
Zeros (%)0.0%
Memory size296.0 B

Quantile statistics

Minimum1
5-th percentile2
Q17
median12
Q317
95-th percentile21
Maximum22
Range21
Interquartile range (IQR)10

Descriptive statistics

Standard deviation6.428730157
Coefficient of variation (CV)0.5421820614
Kurtosis-1.10272247
Mean11.85714286
Median Absolute Deviation (MAD)5
Skewness-0.09665560827
Sum249
Variance41.32857143
MonotocityNot monotonic
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%) 
2214.8%
 
1114.8%
 
214.8%
 
314.8%
 
514.8%
 
614.8%
 
714.8%
 
814.8%
 
914.8%
 
1014.8%
 
1214.8%
 
2114.8%
 
1314.8%
 
1414.8%
 
1514.8%
 
1614.8%
 
1714.8%
 
1814.8%
 
1914.8%
 
2014.8%
 
114.8%
 
ValueCountFrequency (%) 
114.8%
 
214.8%
 
314.8%
 
514.8%
 
614.8%
 
714.8%
 
814.8%
 
914.8%
 
1014.8%
 
1114.8%
 
ValueCountFrequency (%) 
2214.8%
 
2114.8%
 
2014.8%
 
1914.8%
 
1814.8%
 
1714.8%
 
1614.8%
 
1514.8%
 
1414.8%
 
1314.8%
 

denominazione_regione
Categorical

HIGH CORRELATION
UNIQUE

Distinct21
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size296.0 B
Campania
 
1
Piemonte
 
1
Puglia
 
1
Friuli Venezia Giulia
 
1
Basilicata
 
1
Other values (16)
16 
ValueCountFrequency (%) 
Campania14.8%
 
Piemonte14.8%
 
Puglia14.8%
 
Friuli Venezia Giulia14.8%
 
Basilicata14.8%
 
Umbria14.8%
 
Toscana14.8%
 
Marche14.8%
 
Lazio14.8%
 
P.A. Trento14.8%
 
Emilia-Romagna14.8%
 
Sicilia14.8%
 
Liguria14.8%
 
Calabria14.8%
 
P.A. Bolzano14.8%
 
Valle d'Aosta14.8%
 
Abruzzo14.8%
 
Sardegna14.8%
 
Veneto14.8%
 
Lombardia14.8%
 
Molise14.8%
 
Frequencies of value counts

Unique

Unique21 ?
Unique (%)100.0%
Histogram of lengths of the category

Length

Max length21
Median length8
Mean length8.80952381
Min length5

Overview of Unicode Properties

Unique unicode characters36
Unique unicode categories5 ?
Unique unicode scripts2 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
a2915.7%
 
i2211.9%
 
o126.5%
 
l115.9%
 
e115.9%
 
r94.9%
 
n94.9%
 
m63.2%
 
u52.7%
 
z52.7%
 
t52.7%
 
52.7%
 
A42.2%
 
b42.2%
 
s42.2%
 
c42.2%
 
g42.2%
 
P42.2%
 
.42.2%
 
V31.6%
 
L31.6%
 
d31.6%
 
B21.1%
 
C21.1%
 
M21.1%
 
Other values (11)137.0%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter14578.4%
 
Uppercase Letter2915.7%
 
Space Separator52.7%
 
Other Punctuation52.7%
 
Dash Punctuation10.5%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
A413.8%
 
P413.8%
 
V310.3%
 
L310.3%
 
B26.9%
 
C26.9%
 
M26.9%
 
T26.9%
 
S26.9%
 
E13.4%
 
R13.4%
 
F13.4%
 
G13.4%
 
U13.4%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
a2920.0%
 
i2215.2%
 
o128.3%
 
l117.6%
 
e117.6%
 
r96.2%
 
n96.2%
 
m64.1%
 
u53.4%
 
z53.4%
 
t53.4%
 
b42.8%
 
s42.8%
 
c42.8%
 
g42.8%
 
d32.1%
 
p10.7%
 
h10.7%
 

Most frequent Dash Punctuation characters

ValueCountFrequency (%) 
-1100.0%
 

Most frequent Space Separator characters

ValueCountFrequency (%) 
5100.0%
 

Most frequent Other Punctuation characters

ValueCountFrequency (%) 
.480.0%
 
'120.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin17494.1%
 
Common115.9%
 

Most frequent Latin characters

ValueCountFrequency (%) 
a2916.7%
 
i2212.6%
 
o126.9%
 
l116.3%
 
e116.3%
 
r95.2%
 
n95.2%
 
m63.4%
 
u52.9%
 
z52.9%
 
t52.9%
 
A42.3%
 
b42.3%
 
s42.3%
 
c42.3%
 
g42.3%
 
P42.3%
 
V31.7%
 
L31.7%
 
d31.7%
 
B21.1%
 
C21.1%
 
M21.1%
 
T21.1%
 
S21.1%
 
Other values (7)74.0%
 

Most frequent Common characters

ValueCountFrequency (%) 
545.5%
 
.436.4%
 
-19.1%
 
'19.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII185100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
a2915.7%
 
i2211.9%
 
o126.5%
 
l115.9%
 
e115.9%
 
r94.9%
 
n94.9%
 
m63.2%
 
u52.7%
 
z52.7%
 
t52.7%
 
52.7%
 
A42.2%
 
b42.2%
 
s42.2%
 
c42.2%
 
g42.2%
 
P42.2%
 
.42.2%
 
V31.6%
 
L31.6%
 
d31.6%
 
B21.1%
 
C21.1%
 
M21.1%
 
Other values (11)137.0%
 

lat
Real number (ℝ≥0)

UNIQUE

Distinct21
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean43.04629272
Minimum38.11569725
Maximum46.49933453
Zeros0
Zeros (%)0.0%
Memory size296.0 B

Quantile statistics

Minimum38.11569725
5-th percentile38.90597598
Q141.12559576
median43.61675973
Q345.43490485
95-th percentile46.06893511
Maximum46.49933453
Range8.38363728
Interquartile range (IQR)4.30930909

Descriptive statistics

Standard deviation2.550241402
Coefficient of variation (CV)0.05924415881
Kurtosis-0.9632761915
Mean43.04629272
Median Absolute Deviation (MAD)2.03267567
Skewness-0.4524948372
Sum903.9721471
Variance6.50373121
MonotocityNot monotonic
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%) 
45.649435414.8%
 
40.8395655514.8%
 
44.4114931514.8%
 
39.2153119214.8%
 
43.1067584114.8%
 
43.6167597314.8%
 
44.4943668114.8%
 
41.5577475414.8%
 
46.0689351114.8%
 
45.073274514.8%
 
45.4667940914.8%
 
46.4993345314.8%
 
40.6394705214.8%
 
45.7375028614.8%
 
41.1255957614.8%
 
43.7692307714.8%
 
41.8927704414.8%
 
42.3512219614.8%
 
38.9059759814.8%
 
38.1156972514.8%
 
45.4349048514.8%
 
ValueCountFrequency (%) 
38.1156972514.8%
 
38.9059759814.8%
 
39.2153119214.8%
 
40.6394705214.8%
 
40.8395655514.8%
 
41.1255957614.8%
 
41.5577475414.8%
 
41.8927704414.8%
 
42.3512219614.8%
 
43.1067584114.8%
 
ValueCountFrequency (%) 
46.4993345314.8%
 
46.0689351114.8%
 
45.7375028614.8%
 
45.649435414.8%
 
45.4667940914.8%
 
45.4349048514.8%
 
45.073274514.8%
 
44.4943668114.8%
 
44.4114931514.8%
 
43.7692307714.8%
 

long
Real number (ℝ≥0)

UNIQUE

Distinct21
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12.22595548
Minimum7.320149366
Maximum16.86736689
Zeros0
Zeros (%)0.0%
Memory size296.0 B

Quantile statistics

Minimum7.320149366
5-th percentile7.680687483
Q111.12123097
median12.38824698
Q313.76813649
95-th percentile16.59440194
Maximum16.86736689
Range9.547217524
Interquartile range (IQR)2.64690552

Descriptive statistics

Standard deviation2.724610671
Coefficient of variation (CV)0.2228546206
Kurtosis-0.6130741795
Mean12.22595548
Median Absolute Deviation (MAD)1.37988951
Skewness-0.1329549478
Sum256.7450652
Variance7.42350331
MonotocityNot monotonic
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%) 
7.32014936614.8%
 
13.518875314.8%
 
9.11061630614.8%
 
9.19034740414.8%
 
16.5944019414.8%
 
15.8051483414.8%
 
13.362356714.8%
 
11.1212309714.8%
 
11.341720814.8%
 
12.3882469814.8%
 
14.6591605114.8%
 
12.4836672214.8%
 
13.7681364914.8%
 
11.3566242214.8%
 
11.2558888514.8%
 
7.68068748314.8%
 
14.2508498414.8%
 
12.3384521314.8%
 
8.932699214.8%
 
13.3984382314.8%
 
16.8673668914.8%
 
ValueCountFrequency (%) 
7.32014936614.8%
 
7.68068748314.8%
 
8.932699214.8%
 
9.11061630614.8%
 
9.19034740414.8%
 
11.1212309714.8%
 
11.2558888514.8%
 
11.341720814.8%
 
11.3566242214.8%
 
12.3384521314.8%
 
ValueCountFrequency (%) 
16.8673668914.8%
 
16.5944019414.8%
 
15.8051483414.8%
 
14.6591605114.8%
 
14.2508498414.8%
 
13.7681364914.8%
 
13.518875314.8%
 
13.3984382314.8%
 
13.362356714.8%
 
12.4836672214.8%
 

ricoverati_con_sintomi
Real number (ℝ≥0)

HIGH CORRELATION
UNIQUE

Distinct21
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1259.047619
Minimum39
Maximum6225
Zeros0
Zeros (%)0.0%
Memory size296.0 B

Quantile statistics

Minimum39
5-th percentile102
Q1301
median493
Q31474
95-th percentile4367
Maximum6225
Range6186
Interquartile range (IQR)1173

Descriptive statistics

Standard deviation1544.528325
Coefficient of variation (CV)1.226743375
Kurtosis4.998073514
Mean1259.047619
Median Absolute Deviation (MAD)391
Skewness2.176431487
Sum26440
Variance2385567.748
MonotocityNot monotonic
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%) 
25514.8%
 
147414.8%
 
35314.8%
 
49314.8%
 
40014.8%
 
48414.8%
 
10214.8%
 
3914.8%
 
23314.8%
 
15814.8%
 
183614.8%
 
268614.8%
 
30114.8%
 
39814.8%
 
436714.8%
 
88014.8%
 
622514.8%
 
133414.8%
 
181714.8%
 
125014.8%
 
135514.8%
 
ValueCountFrequency (%) 
3914.8%
 
10214.8%
 
15814.8%
 
23314.8%
 
25514.8%
 
30114.8%
 
35314.8%
 
39814.8%
 
40014.8%
 
48414.8%
 
ValueCountFrequency (%) 
622514.8%
 
436714.8%
 
268614.8%
 
183614.8%
 
181714.8%
 
147414.8%
 
135514.8%
 
133414.8%
 
125014.8%
 
88014.8%
 

terapia_intensiva
Real number (ℝ≥0)

HIGH CORRELATION

Distinct20
Distinct (%)95.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean130.9047619
Minimum7
Maximum650
Zeros0
Zeros (%)0.0%
Memory size296.0 B

Quantile statistics

Minimum7
5-th percentile14
Q139
median69
Q3186
95-th percentile304
Maximum650
Range643
Interquartile range (IQR)147

Descriptive statistics

Standard deviation148.3175326
Coefficient of variation (CV)1.133018619
Kurtosis6.950829734
Mean130.9047619
Median Absolute Deviation (MAD)54
Skewness2.315001219
Sum2749
Variance21998.09048
MonotocityNot monotonic
Histogram with fixed size bins (bins=20)
ValueCountFrequency (%) 
1629.5%
 
22614.8%
 
3914.8%
 
6914.8%
 
714.8%
 
65014.8%
 
4314.8%
 
4514.8%
 
1414.8%
 
17714.8%
 
30414.8%
 
8114.8%
 
2014.8%
 
6214.8%
 
5514.8%
 
23714.8%
 
18514.8%
 
18614.8%
 
12314.8%
 
19414.8%
 
ValueCountFrequency (%) 
714.8%
 
1414.8%
 
1629.5%
 
2014.8%
 
3914.8%
 
4314.8%
 
4514.8%
 
5514.8%
 
6214.8%
 
6914.8%
 
ValueCountFrequency (%) 
65014.8%
 
30414.8%
 
23714.8%
 
22614.8%
 
19414.8%
 
18614.8%
 
18514.8%
 
17714.8%
 
12314.8%
 
8114.8%
 

totale_ospedalizzati
Real number (ℝ≥0)

HIGH CORRELATION
UNIQUE

Distinct21
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1389.952381
Minimum46
Maximum6875
Zeros0
Zeros (%)0.0%
Memory size296.0 B

Quantile statistics

Minimum46
5-th percentile118
Q1346
median562
Q31700
95-th percentile4671
Maximum6875
Range6829
Interquartile range (IQR)1354

Descriptive statistics

Standard deviation1687.635994
Coefficient of variation (CV)1.21416821
Kurtosis5.175375031
Mean1389.952381
Median Absolute Deviation (MAD)444
Skewness2.187878396
Sum29189
Variance2848115.248
MonotocityNot monotonic
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%) 
467114.8%
 
203014.8%
 
170014.8%
 
154014.8%
 
141514.8%
 
45514.8%
 
27514.8%
 
41514.8%
 
292314.8%
 
17214.8%
 
52714.8%
 
142714.8%
 
100314.8%
 
56214.8%
 
200314.8%
 
43714.8%
 
11814.8%
 
24914.8%
 
34614.8%
 
687514.8%
 
4614.8%
 
ValueCountFrequency (%) 
4614.8%
 
11814.8%
 
17214.8%
 
24914.8%
 
27514.8%
 
34614.8%
 
41514.8%
 
43714.8%
 
45514.8%
 
52714.8%
 
ValueCountFrequency (%) 
687514.8%
 
467114.8%
 
292314.8%
 
203014.8%
 
200314.8%
 
170014.8%
 
154014.8%
 
142714.8%
 
141514.8%
 
100314.8%
 

isolamento_domiciliare
Real number (ℝ≥0)

HIGH CORRELATION
UNIQUE

Distinct21
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean25211.7619
Minimum1596
Maximum125535
Zeros0
Zeros (%)0.0%
Memory size296.0 B

Quantile statistics

Minimum1596
5-th percentile2000
Q17295
median9679
Q341747
95-th percentile67649
Maximum125535
Range123939
Interquartile range (IQR)34452

Descriptive statistics

Standard deviation30427.36511
Coefficient of variation (CV)1.206871825
Kurtosis4.958576292
Mean25211.7619
Median Absolute Deviation (MAD)7201
Skewness2.046329718
Sum529447
Variance925824547.4
MonotocityNot monotonic
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%) 
729514.8%
 
967914.8%
 
4943014.8%
 
477114.8%
 
880414.8%
 
938514.8%
 
4845514.8%
 
253614.8%
 
797214.8%
 
3582214.8%
 
200014.8%
 
755014.8%
 
1099314.8%
 
12553514.8%
 
247814.8%
 
4174714.8%
 
4952614.8%
 
1618414.8%
 
2004014.8%
 
159614.8%
 
6764914.8%
 
ValueCountFrequency (%) 
159614.8%
 
200014.8%
 
247814.8%
 
253614.8%
 
477114.8%
 
729514.8%
 
755014.8%
 
797214.8%
 
880414.8%
 
938514.8%
 
ValueCountFrequency (%) 
12553514.8%
 
6764914.8%
 
4952614.8%
 
4943014.8%
 
4845514.8%
 
4174714.8%
 
3582214.8%
 
2004014.8%
 
1618414.8%
 
1099314.8%
 

totale_positivi
Real number (ℝ≥0)

HIGH CORRELATION
UNIQUE

Distinct21
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean26601.71429
Minimum1642
Maximum132410
Zeros0
Zeros (%)0.0%
Memory size296.0 B

Quantile statistics

Minimum1642
5-th percentile2172
Q17641
median10241
Q343447
95-th percentile69652
Maximum132410
Range130768
Interquartile range (IQR)35806

Descriptive statistics

Standard deviation31982.10567
Coefficient of variation (CV)1.202257318
Kurtosis5.041375532
Mean26601.71429
Median Absolute Deviation (MAD)7587
Skewness2.054894224
Sum558636
Variance1022855083
MonotocityNot monotonic
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%) 
265414.8%
 
798714.8%
 
1024114.8%
 
275314.8%
 
4344714.8%
 
980014.8%
 
217214.8%
 
164214.8%
 
842714.8%
 
5097014.8%
 
5137814.8%
 
502014.8%
 
933114.8%
 
6965214.8%
 
5419714.8%
 
3785214.8%
 
1240814.8%
 
764114.8%
 
13241014.8%
 
2146714.8%
 
1718714.8%
 
ValueCountFrequency (%) 
164214.8%
 
217214.8%
 
265414.8%
 
275314.8%
 
502014.8%
 
764114.8%
 
798714.8%
 
842714.8%
 
933114.8%
 
980014.8%
 
ValueCountFrequency (%) 
13241014.8%
 
6965214.8%
 
5419714.8%
 
5137814.8%
 
5097014.8%
 
4344714.8%
 
3785214.8%
 
2146714.8%
 
1718714.8%
 
1240814.8%
 

variazione_totale_positivi
Real number (ℝ)

HIGH CORRELATION
UNIQUE

Distinct21
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1242.857143
Minimum-9
Maximum4781
Zeros0
Zeros (%)0.0%
Memory size296.0 B

Quantile statistics

Minimum-9
5-th percentile85
Q1281
median436
Q32203
95-th percentile4146
Maximum4781
Range4790
Interquartile range (IQR)1922

Descriptive statistics

Standard deviation1446.943651
Coefficient of variation (CV)1.164207535
Kurtosis0.5768657018
Mean1242.857143
Median Absolute Deviation (MAD)294
Skewness1.303424349
Sum26100
Variance2093645.929
MonotocityNot monotonic
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%) 
41414.8%
 
30014.8%
 
50714.8%
 
61414.8%
 
35614.8%
 
73014.8%
 
234214.8%
 
13514.8%
 
186414.8%
 
-914.8%
 
478114.8%
 
220314.8%
 
43014.8%
 
23914.8%
 
331314.8%
 
414614.8%
 
43614.8%
 
8514.8%
 
28114.8%
 
270814.8%
 
22514.8%
 
ValueCountFrequency (%) 
-914.8%
 
8514.8%
 
13514.8%
 
22514.8%
 
23914.8%
 
28114.8%
 
30014.8%
 
35614.8%
 
41414.8%
 
43014.8%
 
ValueCountFrequency (%) 
478114.8%
 
414614.8%
 
331314.8%
 
270814.8%
 
234214.8%
 
220314.8%
 
186414.8%
 
73014.8%
 
61414.8%
 
50714.8%
 

nuovi_positivi
Real number (ℝ≥0)

HIGH CORRELATION
UNIQUE

Distinct21
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1553.142857
Minimum55
Maximum6318
Zeros0
Zeros (%)0.0%
Memory size296.0 B

Quantile statistics

Minimum55
5-th percentile91
Q1424
median766
Q32479
95-th percentile4601
Maximum6318
Range6263
Interquartile range (IQR)2055

Descriptive statistics

Standard deviation1725.957134
Coefficient of variation (CV)1.11126747
Kurtosis1.579957059
Mean1553.142857
Median Absolute Deviation (MAD)520
Skewness1.477740624
Sum32616
Variance2978928.029
MonotocityNot monotonic
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%) 
76614.8%
 
42414.8%
 
336214.8%
 
50214.8%
 
35914.8%
 
58414.8%
 
388414.8%
 
78114.8%
 
631814.8%
 
247914.8%
 
5514.8%
 
18214.8%
 
88614.8%
 
248914.8%
 
24614.8%
 
50414.8%
 
236014.8%
 
460114.8%
 
108314.8%
 
9114.8%
 
66014.8%
 
ValueCountFrequency (%) 
5514.8%
 
9114.8%
 
18214.8%
 
24614.8%
 
35914.8%
 
42414.8%
 
50214.8%
 
50414.8%
 
58414.8%
 
66014.8%
 
ValueCountFrequency (%) 
631814.8%
 
460114.8%
 
388414.8%
 
336214.8%
 
248914.8%
 
247914.8%
 
236014.8%
 
108314.8%
 
88614.8%
 
78114.8%
 

dimessi_guariti
Real number (ℝ≥0)

HIGH CORRELATION
UNIQUE

Distinct21
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15955.90476
Minimum752
Maximum110001
Zeros0
Zeros (%)0.0%
Memory size296.0 B

Quantile statistics

Minimum752
5-th percentile805
Q14592
median7668
Q317938
95-th percentile38953
Maximum110001
Range109249
Interquartile range (IQR)13346

Descriptive statistics

Standard deviation23876.36924
Coefficient of variation (CV)1.496397076
Kurtosis12.87212427
Mean15955.90476
Median Absolute Deviation (MAD)5363
Skewness3.352890689
Sum335074
Variance570081007.9
MonotocityNot monotonic
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%) 
1302314.8%
 
459214.8%
 
2874014.8%
 
80514.8%
 
912814.8%
 
3895314.8%
 
749814.8%
 
2572514.8%
 
663614.8%
 
75214.8%
 
11000114.8%
 
479814.8%
 
1793814.8%
 
766814.8%
 
175814.8%
 
495114.8%
 
1644114.8%
 
2210614.8%
 
357914.8%
 
767714.8%
 
230514.8%
 
ValueCountFrequency (%) 
75214.8%
 
80514.8%
 
175814.8%
 
230514.8%
 
357914.8%
 
459214.8%
 
479814.8%
 
495114.8%
 
663614.8%
 
749814.8%
 
ValueCountFrequency (%) 
11000114.8%
 
3895314.8%
 
2874014.8%
 
2572514.8%
 
2210614.8%
 
1793814.8%
 
1644114.8%
 
1302314.8%
 
912814.8%
 
767714.8%
 

deceduti
Real number (ℝ≥0)

HIGH CORRELATION
UNIQUE

Distinct21
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1971.142857
Minimum48
Maximum18343
Zeros0
Zeros (%)0.0%
Memory size296.0 B

Quantile statistics

Minimum48
5-th percentile64
Q1255
median676
Q31561
95-th percentile4816
Maximum18343
Range18295
Interquartile range (IQR)1306

Descriptive statistics

Standard deviation3988.89282
Coefficient of variation (CV)2.023644712
Kurtosis15.71496993
Mean1971.142857
Median Absolute Deviation (MAD)488
Skewness3.808401186
Sum41394
Variance15911265.93
MonotocityNot monotonic
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%) 
25514.8%
 
257414.8%
 
45114.8%
 
67614.8%
 
141314.8%
 
83714.8%
 
1834314.8%
 
462914.8%
 
14014.8%
 
105014.8%
 
481614.8%
 
20514.8%
 
46514.8%
 
18814.8%
 
59614.8%
 
34114.8%
 
4814.8%
 
156114.8%
 
82614.8%
 
191614.8%
 
6414.8%
 
ValueCountFrequency (%) 
4814.8%
 
6414.8%
 
14014.8%
 
18814.8%
 
20514.8%
 
25514.8%
 
34114.8%
 
45114.8%
 
46514.8%
 
59614.8%
 
ValueCountFrequency (%) 
1834314.8%
 
481614.8%
 
462914.8%
 
257414.8%
 
191614.8%
 
156114.8%
 
141314.8%
 
105014.8%
 
83714.8%
 
82614.8%
 

casi_da_sospetto_diagnostico
Real number (ℝ≥0)

HIGH CORRELATION
UNIQUE

Distinct21
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean28638.2381
Minimum1231
Maximum201314
Zeros0
Zeros (%)0.0%
Memory size296.0 B

Quantile statistics

Minimum1231
5-th percentile1275
Q14760
median13126
Q326082
95-th percentile84257
Maximum201314
Range200083
Interquartile range (IQR)21322

Descriptive statistics

Standard deviation44816.88099
Coefficient of variation (CV)1.564931503
Kurtosis11.60222555
Mean28638.2381
Median Absolute Deviation (MAD)9492
Skewness3.191577664
Sum601403
Variance2008552822
MonotocityNot monotonic
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%) 
761514.8%
 
4232114.8%
 
20131414.8%
 
1312614.8%
 
1860014.8%
 
363414.8%
 
4893814.8%
 
476014.8%
 
2532614.8%
 
123114.8%
 
1937814.8%
 
418914.8%
 
1278714.8%
 
600414.8%
 
2008614.8%
 
239214.8%
 
5090514.8%
 
718314.8%
 
127514.8%
 
2608214.8%
 
8425714.8%
 
ValueCountFrequency (%) 
123114.8%
 
127514.8%
 
239214.8%
 
363414.8%
 
418914.8%
 
476014.8%
 
600414.8%
 
718314.8%
 
761514.8%
 
1278714.8%
 
ValueCountFrequency (%) 
20131414.8%
 
8425714.8%
 
5090514.8%
 
4893814.8%
 
4232114.8%
 
2608214.8%
 
2532614.8%
 
2008614.8%
 
1937814.8%
 
1860014.8%
 

casi_da_screening
Real number (ℝ≥0)

UNIQUE
ZEROS

Distinct21
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15890.52381
Minimum0
Maximum59440
Zeros1
Zeros (%)4.8%
Memory size296.0 B

Quantile statistics

Minimum0
5-th percentile50
Q12292
median7501
Q318518
95-th percentile55458
Maximum59440
Range59440
Interquartile range (IQR)16226

Descriptive statistics

Standard deviation19724.99779
Coefficient of variation (CV)1.2413057
Kurtosis0.5785106014
Mean15890.52381
Median Absolute Deviation (MAD)5560
Skewness1.413606825
Sum333701
Variance389075537.7
MonotocityNot monotonic
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%) 
1075014.8%
 
619014.8%
 
014.8%
 
5545814.8%
 
5318714.8%
 
488214.8%
 
266214.8%
 
1204114.8%
 
50114.8%
 
1110414.8%
 
2247014.8%
 
18914.8%
 
5944014.8%
 
750114.8%
 
5014.8%
 
229214.8%
 
194114.8%
 
1851814.8%
 
690414.8%
 
1189314.8%
 
4572814.8%
 
ValueCountFrequency (%) 
014.8%
 
5014.8%
 
18914.8%
 
50114.8%
 
194114.8%
 
229214.8%
 
266214.8%
 
488214.8%
 
619014.8%
 
690414.8%
 
ValueCountFrequency (%) 
5944014.8%
 
5545814.8%
 
5318714.8%
 
4572814.8%
 
2247014.8%
 
1851814.8%
 
1204114.8%
 
1189314.8%
 
1110414.8%
 
1075014.8%
 

totale_casi
Real number (ℝ≥0)

HIGH CORRELATION
UNIQUE

Distinct21
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean44528.7619
Minimum2442
Maximum260754
Zeros0
Zeros (%)0.0%
Memory size296.0 B

Quantile statistics

Minimum2442
5-th percentile3523
Q112261
median18789
Q365814
95-th percentile97779
Maximum260754
Range258312
Interquartile range (IQR)53553

Descriptive statistics

Standard deviation58158.16131
Coefficient of variation (CV)1.306080808
Kurtosis9.575947159
Mean44528.7619
Median Absolute Deviation (MAD)14654
Skewness2.793802567
Sum935104
Variance3382371727
MonotocityNot monotonic
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%) 
7926914.8%
 
244214.8%
 
6294614.8%
 
352314.8%
 
3127114.8%
 
1878914.8%
 
1312614.8%
 
8691914.8%
 
1472814.8%
 
746514.8%
 
1088614.8%
 
1493914.8%
 
3643014.8%
 
2570114.8%
 
7140814.8%
 
26075414.8%
 
9777914.8%
 
6581414.8%
 
1451914.8%
 
1226114.8%
 
413514.8%
 
ValueCountFrequency (%) 
244214.8%
 
352314.8%
 
413514.8%
 
746514.8%
 
1088614.8%
 
1226114.8%
 
1312614.8%
 
1451914.8%
 
1472814.8%
 
1493914.8%
 
ValueCountFrequency (%) 
26075414.8%
 
9777914.8%
 
8691914.8%
 
7926914.8%
 
7140814.8%
 
6581414.8%
 
6294614.8%
 
3643014.8%
 
3127114.8%
 
2570114.8%
 

tamponi
Real number (ℝ≥0)

HIGH CORRELATION
UNIQUE

Distinct21
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean827367.2857
Minimum45323
Maximum3256646
Zeros0
Zeros (%)0.0%
Memory size296.0 B

Quantile statistics

Minimum45323
5-th percentile69856
Q1298954
median486171
Q31149489
95-th percentile2448503
Maximum3256646
Range3211323
Interquartile range (IQR)850535

Descriptive statistics

Standard deviation841944.6288
Coefficient of variation (CV)1.01761895
Kurtosis2.457299353
Mean827367.2857
Median Absolute Deviation (MAD)266923
Skewness1.635314535
Sum17374713
Variance7.08870758e+11
MonotocityNot monotonic
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%) 
172847714.8%
 
60836414.8%
 
164643314.8%
 
29379414.8%
 
25630214.8%
 
4532314.8%
 
244850314.8%
 
325664614.8%
 
29895414.8%
 
34549614.8%
 
11537414.8%
 
48617114.8%
 
58081514.8%
 
30894414.8%
 
114948914.8%
 
75309414.8%
 
32981214.8%
 
112370314.8%
 
31512814.8%
 
121403514.8%
 
6985614.8%
 
ValueCountFrequency (%) 
4532314.8%
 
6985614.8%
 
11537414.8%
 
25630214.8%
 
29379414.8%
 
29895414.8%
 
30894414.8%
 
31512814.8%
 
32981214.8%
 
34549614.8%
 
ValueCountFrequency (%) 
325664614.8%
 
244850314.8%
 
172847714.8%
 
164643314.8%
 
121403514.8%
 
114948914.8%
 
112370314.8%
 
75309414.8%
 
60836414.8%
 
58081514.8%
 

casi_testati
Real number (ℝ≥0)

HIGH CORRELATION
UNIQUE

Distinct21
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean502240.2857
Minimum27399
Maximum2003397
Zeros0
Zeros (%)0.0%
Memory size296.0 B

Quantile statistics

Minimum27399
5-th percentile65390
Q1189320
median248706
Q3778735
95-th percentile1341488
Maximum2003397
Range1975998
Interquartile range (IQR)589415

Descriptive statistics

Standard deviation496087.1767
Coefficient of variation (CV)0.9877486749
Kurtosis3.016624186
Mean502240.2857
Median Absolute Deviation (MAD)183316
Skewness1.66956899
Sum10547046
Variance2.461024869e+11
MonotocityNot monotonic
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%) 
19337314.8%
 
77873514.8%
 
20291514.8%
 
200339714.8%
 
2739914.8%
 
6539014.8%
 
18932014.8%
 
91473114.8%
 
43321314.8%
 
11441414.8%
 
134148814.8%
 
52501314.8%
 
79934514.8%
 
13032314.8%
 
24837314.8%
 
94870114.8%
 
72619914.8%
 
11784814.8%
 
24229714.8%
 
29586614.8%
 
24870614.8%
 
ValueCountFrequency (%) 
2739914.8%
 
6539014.8%
 
11441414.8%
 
11784814.8%
 
13032314.8%
 
18932014.8%
 
19337314.8%
 
20291514.8%
 
24229714.8%
 
24837314.8%
 
ValueCountFrequency (%) 
200339714.8%
 
134148814.8%
 
94870114.8%
 
91473114.8%
 
79934514.8%
 
77873514.8%
 
72619914.8%
 
52501314.8%
 
43321314.8%
 
29586614.8%
 

note
Categorical

HIGH CORRELATION
MISSING
UNIFORM

Distinct5
Distinct (%)100.0%
Missing16
Missing (%)76.2%
Memory size296.0 B
per 820 casi non e' disponibile la provenienza
Dal totale dei positivi è stato eliminato 1 caso in quanto paziente non covid.
TOTALE TEST ANTIGENICI RAPIDI EFFETTUATI : 341 - TOTALE TEST ANTIGENICI RAPIDI POSITIVI : 62 - TOTALE POSITIVI AL TEST ANTIGENICO E SINTOMATICI : 9
IN CORSO REVISIONE DATI PER TAMPONI TOTALI EFFETTUATI E CASI TESTATI E PER TOTALE CASI POSITIVI RESIDENTI IN REGIONE BASILICATA.
In seguito a verifica sui dati comunicati nei giorni passati è stato eliminato 1 caso in quanto giudicato non caso COVID-19.
ValueCountFrequency (%) 
per 820 casi non e' disponibile la provenienza14.8%
 
Dal totale dei positivi è stato eliminato 1 caso in quanto paziente non covid.14.8%
 
TOTALE TEST ANTIGENICI RAPIDI EFFETTUATI : 341 - TOTALE TEST ANTIGENICI RAPIDI POSITIVI : 62 - TOTALE POSITIVI AL TEST ANTIGENICO E SINTOMATICI : 914.8%
 
IN CORSO REVISIONE DATI PER TAMPONI TOTALI EFFETTUATI E CASI TESTATI E PER TOTALE CASI POSITIVI RESIDENTI IN REGIONE BASILICATA.14.8%
 
In seguito a verifica sui dati comunicati nei giorni passati è stato eliminato 1 caso in quanto giudicato non caso COVID-19.14.8%
 
(Missing)1676.2%
 
Frequencies of value counts

Unique

Unique5 ?
Unique (%)100.0%
Histogram of lengths of the category

Length

Max length147
Median length3
Mean length27.19047619
Min length3

Overview of Unicode Properties

Unique unicode characters52
Unique unicode categories6 ?
Unique unicode scripts2 ?
Unique unicode blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
8314.5%
 
n529.1%
 
I437.5%
 
a386.7%
 
T376.5%
 
i305.3%
 
E264.6%
 
A223.9%
 
o213.7%
 
t173.0%
 
O162.8%
 
e142.5%
 
S142.5%
 
N132.3%
 
s122.1%
 
c91.6%
 
C91.6%
 
R81.4%
 
P81.4%
 
L71.2%
 
D61.1%
 
l61.1%
 
p61.1%
 
u61.1%
 
d50.9%
 
Other values (27)6311.0%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter23841.7%
 
Uppercase Letter22739.8%
 
Space Separator8314.5%
 
Decimal Number132.3%
 
Other Punctuation71.2%
 
Dash Punctuation30.5%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
I4318.9%
 
T3716.3%
 
E2611.5%
 
A229.7%
 
O167.0%
 
S146.2%
 
N135.7%
 
C94.0%
 
R83.5%
 
P83.5%
 
L73.1%
 
D62.6%
 
V52.2%
 
F41.8%
 
G41.8%
 
M20.9%
 
U20.9%
 
B10.4%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
n5221.8%
 
a3816.0%
 
i3012.6%
 
o218.8%
 
t177.1%
 
e145.9%
 
s125.0%
 
c93.8%
 
l62.5%
 
p62.5%
 
u62.5%
 
d52.1%
 
v41.7%
 
r41.7%
 
m31.3%
 
g31.3%
 
è20.8%
 
q20.8%
 
z20.8%
 
f10.4%
 
b10.4%
 

Most frequent Space Separator characters

ValueCountFrequency (%) 
83100.0%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
1430.8%
 
9215.4%
 
2215.4%
 
317.7%
 
417.7%
 
617.7%
 
817.7%
 
017.7%
 

Most frequent Other Punctuation characters

ValueCountFrequency (%) 
.342.9%
 
:342.9%
 
'114.3%
 

Most frequent Dash Punctuation characters

ValueCountFrequency (%) 
-3100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin46581.4%
 
Common10618.6%
 

Most frequent Latin characters

ValueCountFrequency (%) 
n5211.2%
 
I439.2%
 
a388.2%
 
T378.0%
 
i306.5%
 
E265.6%
 
A224.7%
 
o214.5%
 
t173.7%
 
O163.4%
 
e143.0%
 
S143.0%
 
N132.8%
 
s122.6%
 
c91.9%
 
C91.9%
 
R81.7%
 
P81.7%
 
L71.5%
 
D61.3%
 
l61.3%
 
p61.3%
 
u61.3%
 
d51.1%
 
V51.1%
 
Other values (14)357.5%
 

Most frequent Common characters

ValueCountFrequency (%) 
8378.3%
 
143.8%
 
.32.8%
 
-32.8%
 
:32.8%
 
921.9%
 
221.9%
 
310.9%
 
410.9%
 
610.9%
 
810.9%
 
010.9%
 
'10.9%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII56999.6%
 
None20.4%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
8314.6%
 
n529.1%
 
I437.6%
 
a386.7%
 
T376.5%
 
i305.3%
 
E264.6%
 
A223.9%
 
o213.7%
 
t173.0%
 
O162.8%
 
e142.5%
 
S142.5%
 
N132.3%
 
s122.1%
 
c91.6%
 
C91.6%
 
R81.4%
 
P81.4%
 
L71.2%
 
D61.1%
 
l61.1%
 
p61.1%
 
u61.1%
 
d50.9%
 
Other values (26)6110.7%
 

Most frequent None characters

ValueCountFrequency (%) 
è2100.0%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

Sample

First rows

datastatocodice_regionedenominazione_regionelatlongricoverati_con_sintomiterapia_intensivatotale_ospedalizzatiisolamento_domiciliaretotale_positivivariazione_totale_positivinuovi_positividimessi_guaritideceduticasi_da_sospetto_diagnosticocasi_da_screeningtotale_casitamponicasi_testatinote
02020-11-08T17:00:00ITA13Abruzzo42.35122213.398438484435278804933143658445925967615690414519315128193373Dal totale dei positivi è stato eliminato 1 caso in quanto paziente non covid.
12020-11-08T17:00:00ITA17Basilicata40.63947115.805148102161182536265423924680564123122923523115374114414IN CORSO REVISIONE DATI PER TAMPONI TOTALI EFFETTUATI E CASI TESTATI E PER TOTALE CASI POSITIVI RESIDENTI IN REGIONE BASILICATA.
22020-11-08T17:00:00ITA18Calabria38.90597616.59440223316249477150202253592305140127561907465298954295866NaN
32020-11-08T17:00:00ITA15Campania40.83956614.2508501817186200367649696524146460116441826842572662869191123703778735NaN
42020-11-08T17:00:00ITA8Emilia-Romagna44.49436711.341721183619420303582237852220323602874048164893822470714081728477914731In seguito a verifica sui dati comunicati nei giorni passati è stato eliminato 1 caso in quanto giudicato non caso COVID-19.
52020-11-08T17:00:00ITA6Friuli Venezia Giulia45.64943513.7681363014534672957641281504663645112787194114728580815242297NaN
62020-11-08T17:00:00ITA12Lazio41.89277012.4836672686237292348455513782342248913023141320086457286581416464331341488NaN
72020-11-08T17:00:00ITA7Liguria44.4114938.93269913348114151099312408300886221061916253261110436430486171248706NaN
82020-11-08T17:00:00ITA3Lombardia45.4667949.1903476225650687512553513241047816318110001183432013145944026075432566462003397NaN
92020-11-08T17:00:00ITA11Marche43.61676013.51887549369562967910241414502749810501860018918789345496202915NaN

Last rows

datastatocodice_regionedenominazione_regionelatlongricoverati_con_sintomiterapia_intensivatotale_ospedalizzatiisolamento_domiciliaretotale_positivivariazione_totale_positivinuovi_positividimessi_guaritideceduticasi_da_sospetto_diagnosticocasi_da_screeningtotale_casitamponicasi_testatinote
112020-11-08T17:00:00ITA21P.A. Bolzano46.49933511.3566243983943775507987507781479834113126013126256302130323TOTALE TEST ANTIGENICI RAPIDI EFFETTUATI : 341 - TOTALE TEST ANTIGENICI RAPIDI POSITIVI : 62 - TOTALE POSITIVI AL TEST ANTIGENICO E SINTOMATICI : 9
122020-11-08T17:00:00ITA22P.A. Trento46.06893511.121231255202752478275313518276684656004488210886308944117848NaN
132020-11-08T17:00:00ITA1Piemonte45.0732747.680687436730446714952654197270838843895346294232155458977791149489726199NaN
142020-11-08T17:00:00ITA16Puglia41.12559616.86736788012310031618417187614766767783771831851825701608364433213per 820 casi non e' disponibile la provenienza
152020-11-08T17:00:00ITA20Sardegna39.2153129.110616400554557972842735642435792554760750112261293794248373NaN
162020-11-08T17:00:00ITA19Sicilia38.11569713.36235712501771427200402146773010839128676193781189331271753094525013NaN
172020-11-08T17:00:00ITA9Toscana43.76923111.255889147422617004174743447186424791793815615090512041629461214035799345NaN
182020-11-08T17:00:00ITA10Umbria43.10675812.3882473536241593859800430660495118841891075014939329812189320NaN
192020-11-08T17:00:00ITA2Valle d'Aosta45.7375037.3201491581417220002172-9551758205363450141354532327399NaN
202020-11-08T17:00:00ITA5Veneto45.43490512.338452135518515404943050970331333622572525742608253187792692448503948701NaN